Explaining inference queries with bayesian optimization
نویسندگان
چکیده
Obtaining an explanation for SQL query result can enrich the analysis experience, reveal data errors, and provide deeper insight into data. Inference seeks to explain unexpected aggregate results on inference data; such queries are challenging because may need be derived from source, training, or in ML pipeline. In this paper, we model objective function as a black-box propose BOExplain, novel framework explaining using Bayesian optimization (BO). An is predicate defining input tuples that should removed so of interest significantly affected. BO --- technique finding global optimum used find best predicate. We develop two new techniques (individual contribution encoding warm start) handle categorical variables. perform experiments showing predicates found by BOExplain have higher degree compared those state-of-the-art engines. also show effective at deriving explanations source training variety real-world datasets. open-sourced Python package https://github.com/sfu-db/BOExplain.
منابع مشابه
An Optimization Based Algorithm for Bayesian Inference
In the Bayesian statistical paradigm, uncertainty in the parameters of a physical system is characterized by a probability distribution. Information from observations is incorporated by updating this distribution from prior to posterior. Quantities of interest, such as credible regions, event probabilities, and other expectations can then be obtained from the posterior distribution. One major t...
متن کاملBayesian Nonparametric and Parametric Inference
This paper reviews Bayesian Nonparametric methods and discusses how parametric predictive densities can be constructed using nonparametric ideas.
متن کاملEstimation with Bayesian Inference
A probabilistic technique for the joint estimation of background and sources in high-energy astrophysics is described. Bayesian inference is applied to gain insight into the coexistence of background and sources through a probabilistic two-component mixture model, which provides consistent uncertainties of background and sources. The present analysis is applied on ROSAT PSPC data in Survey Mode...
متن کاملExplaining Type Inference
Type inference is the compile-time process of reconstructing missing type information in a program based on the usage of its variables. ML and Haskell are two languages where this aspect of compilation has enjoyed some popularity, allowing type information to be omitted while static type checking is still performed. Type inference may be expected to have some application in the prototyping and ...
متن کاملBayesian approach to inference of population structure
Methods of inferring the population structure, its applications in identifying disease models as well as foresighting the physical and mental situation of human beings have been finding ever-increasing importance. In this article, first, motivation and significance of studying the problem of population structure is explained. In the next section, the applications of inference of p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the VLDB Endowment
سال: 2021
ISSN: ['2150-8097']
DOI: https://doi.org/10.14778/3476249.3476304